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Apparatus and Method for resource arbitration 



TECHNICAL FIELD 

The present invention relates generally to systems whose resources include contending 
requesters and servers, and more particularly to a method and apparatus for arbitrating 
suitable matches between such requesters and servers. 

BACKGROUND OF INVENTION 

In many information processing systems having a number of system resources, often 
called servers, the situation frequently arises wherein a set of requesters must arbitrate for 
service from one of a number of servers. For instance, in a data packet switching 
application, the requesters may each need to transmit one or more cells (i.e., small 
packets of information) to various ones of the servers. Each requester receives cells from 
one or more system users (e.g., components) that need to be forwarded to specified one of 
the servers. If the requester has at least one cell that needs to be forwaided to a server, 
then the requester will generate a request for access. The problem is to rapidly produce a 
set of one-to-one (point to point) matchings that maximizes the number of connections 
between requesters and servers. Preferably, the matchings should be produced without 
imposing significant overheads on the system. 

Although the subsequent material herein describes a local area-switching network, the 
present invention may be applied in any application in which a rapid matching of 
requestors and servers is needed. 

The present invention has application in an arbitration mechanism and procedure that 
provides maximal matching v^thin a relatively small number of clock cycles. Fast 
matching is important so that the arbitration procedure can be repeated quickly for 
successive sets of requests. In particular, the arbitration procedure for one set of requests 
should be no longer than the amount of time needed to transmit the requests granted 
during the previous arbitration cycle. When the requests are small cells that are 
transmitted in just a few clock cycles, the arbitration mechanism has only a similar 
number of clock cycles to arbitrate competing requests. The arbitration mechanism of the 
present invention can be implemented using presently existing circuit components. 

In one application of the invention each input port receives, and stores in an input buffer, 
a set of cells, with each cell member bearing an address that identifies an output port to 
which the cell requests routing. The arbitration mechanism now must satisfy not one 
request, but a set of requests. This is a relatively difficult and time-consuming task 
because many arbitration matches are possible between cells in the input port sets and the 
output ports. Some matches will be better than others will; the problem is to rapidly find 
a good match satisfying the largest number of contending requests. 



Crossbar Arbitration 



Switching (i.e., connecting requesters to servers, or transmitting data from senders to 
receivers) involves two separate tasks. First, a scheduling task is performed, wherein an 
arbitration mechanism selects which of potentially many requests to send to each server. 
Second, a data-forwarding task is performed, in which a switching mechanism forwards 
requests or cells to servers in accordance with the selections made by the scheduling task. 

Bipartite Graph Matching 

At the center of high-speed communication switches, there is generally a crossbar matrix 
switch to do the actual switching at the physical layer. Crossbar switches enable point-to- 
point connections to be configured to connect one switch port to another. The crossbar 
has to have connections formed across it by some form of algorithm. No two inputs can 
connect to the same output (fig: 1®) and no two outputs can receive different cells from 
the same input (fig: 1®), the acceptable solutions are fig: 1(D®. This set of connections 
must be produced from a set of requests that come from the input ports. This is 
commonly known as a Bipartite Graph Matching problem. There are many different 
solutions to this problem. Each solution addresses a different weakness and attempts to 
resolve it. 

Pointers 

Nearly all Bipartite Graph Matching algorithms involve some mention of pointers, so 
some explanation is needed. Each input and output port has a pointer that indicates where 
the arbiter will attempt to construct connections. In most algorithms there is both grant 
and an accept pointer, these are associated with output and input ports respectively. It is 
with the manipulation of these pointers that most of the algorithms are concerned. 

Fairness 

All Bipartite Graph Matching algorithms attempt to allocate potential connections fairly. 
In practice, this means that over a statistically significant period of time that potential 
connections will be distributed evenly between all of the ports with connection requests. 

Controllability 

The definition of faimess used up to this point has been that bandwidth should be 
allocated as evenly as possible between the ports requesting connections. This definition 
is overly simplistic; in the real world, this definition becomes much too rigid. What is 
really needed is the ability to control what bandwidth is allocated where and to whom. It 
was with this aim in mind that Probabilistic Masking was developed (GB.9828 144. 76-22- 
12-98. Data Switching Apparatus) where connection bandwidth allocation is performed 
by a Probabilistic Masking unit placed before the arbitration stage. Probabilistic Masking 
functions by (Pseudo) randomly removing connection requests to a varying degree before 
they reach the arbitration stage, thus bandwidth on that particular connection is limited 
enforcing varying levels of Quality of Service. The Probabilistic Masking arrangement 



has a limitation that it does not operate in a work conserving manner, when a request is 
masked potential bandwidth is lost, this bandwidth is not then allocated elsewhere. 

It is an object of the invention to provide an arbitration arrangement that overcomes this 
limitation. 

SUMMARY OF THE INVENTION 

According to the invention there is provided an arbitration arrangement for use in 
selecting the connections to be made between ingress and egress ports of a memoryless 
cross-bar switch of a data switching system, the arbitration arrangement comprising a 
three phase process involving (i) a request phase in which each ingress port sends its 
connection requests to the egress ports to which a connection is required, (ii) a grant 
phase in which each egress port examines the requests directed to it and selects one 
request for grant returning a grant signal indicative of the selected request and directed to 
the appropriate ingress port and (iii) an accept phase in which each ingress port examines 
the received grant signals and selects one to accept thereby defining an ingress to egress 
port connection across the memoryless cross-bar switch, characterised in that each 
ingress port is allocated a default weight which is decremented by one each time a 
connection is made in the cross-bar switch involving that ingress port and a connection 
request is only generated in the request phase if the value of the weight is more than zero 
and the weights are reset to their default value when all requests to an egress port have 
had their weight values decremented to zero. 

According to a feature of the invention there is provided an arbitration arrangement in 
which each ingress port request is applied to a masking element comprising a default 
weight register, programmed with the default weight to be allocated to the said ingress 
port by a system control device, a working register whose contents are decremented by 
one each time a cross-bar switch connection is made involving the said ingress port, the 
contents of the working register being used to mask the connection request generated by 
the ingress port when said contents are zero. 

Also according to the invention there is provided an . arbitration process for use in 
selecting the connections to be made between ingress and egress ports of a memoryless 
cross-bar switch of a data switching system the arbitration process comprising a three 
phase process involving (i) a request phase in which each ingress port sends its 
connection requests to the egress ports to which a connection is required, (ii) a grant 
phase in which each egress port examines the requests directed to it and selects one 
request for grant returning a grant signal indicative of the selected request and directed to 
the appropriate ingress port and (iii) an accept phase in which each ingress port examines 
the received grant signals and selects one to accept thereby defining an ingress to egress 
port connection across the memoryless cross-bar switch, characterised in that each 
ingress port is allocated a default weight which is decremented by one each time a 
cormection is made in the cross-bar switch involving that ingress port and a connection 
request is only generated in the request phase if the value of the weight is more than zero 
and the weights are reset to their default value when all requests to an egress port have 
had their weight values decremented to zero. 



Functional Description 



The new arbitration algorithm's function is to take a data set of connection request bits 
and form connections obeying the rules described above in the crossbar arbitration 
section. When viewed over a statistically significant amount of time the connections 
formed by the algorithm are made more or less likely to be formed depending on the 
weight allocated to that particular link. The higher the weight for a connection the more 
likely it is for that particular connection to be formed. This ability to define the likelihood 
of connections being made enables deterministic work-consei*ving bandwidth 
management. 

Bandwidth Management 

The new algorithm incorporates all of the functionality that is provided by a Probabilistic 
Masking unit into the arbitration stage itself In addition, to avoid all of the problems 
associated with probabilistic operation the new arbitration stage behaves in a 
deterministic way. This is achieved by taking an arbitration stage and adding extra 
functionality in the form of weights for each connection. When a connection is made, the 
weight is decremenied, if there is no remaining weight the connection cannot be made. 
This extension gives ail of the traffic shaping ability of probabilistic masking, while 
maintaining the work conserving nature of the arbitration stage. 

Deterministic Work Uonserving Operation 

The masking effect of the weights is similar to that of Probabilistic Masking; however, 
the new algorithm behaves in a deterministic way. In addition, any residual bandwidth 
that is not being used is allocated in a work-conserving manner. Bandwidth Allocation 
guarantees that a minimum level of service will be delivered under high load. Work 
conserving operation allows the switch to reduce latency under lower loads. 

Variable Granularity 

The granularity of bandwidth allocation is controlled by the range of integer values that 
the weighting system can hold, the larger the values the finer the granularity. As the only 
limitation on the weight values is the size of the register that is used to hold their values, 
hence, the granularity of bandwidth is easily altered by simply using more bits. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1: Legal arbitration states. This shows the different possible connection 
combinations for a two-port switch. 

Figure 2: Weight Reset Conditions. This gives an example of weight/request masking. 

Figure 3: Single output port request masking process. This details the entire arbitration 
process for one port. 



Figure 4: shows an example of a masking element associated with a request signal. 
Figure 5: shows weight reset arrangements for one egress port. 

DETAILED DESCRIPTION OF THE PREFERRED ALGORITHMIC 
EMBODIMENT 

The current invention is based upon a three-stage arbitration algorithm. The first and third 
stages remain essentially the same; the second or grant stage is altered to incorporate the 
connection bandwidth allocation functionality. 

Stage 1 Request 

For a thirty-two-port system, there are one thousand and twenty four request signals 
(thirty-two squared); this allows requests for connections from any point to any other 
point to be requested. Creation of requests is an external matter and is handled by another 
mechanism. 

Stage 2 Grant 

For each port, two processes are performed to establish which request to grant. Before the 
round robin process can be used to establish a grant, the weight table for each port must 
be checked to see if it should be reset to its original allocation values. The reset condition 
is that every port with a request has a weight of zero (figure 2). The lack of a request 
from a particular input port effectively masks the weight for that port, removing it from 
the reset calculation. A reset consists of every weight for a particular port being reloaded 
with its original value. These values are held in another set of registers/RAM that can be 
written to, to alter the bandwidth allocation parameters of the arbiter. 

The second process is a Round Robin with an extra grant condition that is dependent 
upon the weights for the connection. As the pointer moves around the set of requests, for 
a »equest to be granted there must also be enough weight for that particular connection 
i.e. a value greater than zero. The weights in effect temporarily mask certain requests. 
Although requests are being rejected, this algorithm remains work conserving by ignoring 
weights that have no requests relating to them. If there are only request with no weight 
the weights will be reset (see above) and connections will be made. This work conserving 
nature guarantees a minimum bandwidth at high load, but allocates more if it becomes 
available. 

The operation of this masking effect is shown in figure three, a single output port from a 
four-port arbiter. The requests from each input port ® are shown, these are then 
individually ANDed with the weights ® for each input port. The result (D is then 
presented to a RR pointer for a request to be granted. It can be seen that the pointer 
moves round from the last location ® to the next location with a valid request.®.. 



stage 3 Accept 



In the third stage of the arbitration, each port that has received a grant © uses a round 
robin ® to pick which one to accept ® . Once an input and an output port have negotiated 
a connection, they then set the appropriate blocked flags, eliminating themselves from the 
selection process. Once a connection has been confirmed, the weight for that connection 
is decremented by one ®. The grant and accept pointer values for the connection 
concemed are then updated ®. 

The masking effect of the weights is similar to that of Probabilistic Masking; however, 
the new algorithm behaves in a deterministic way. 

DETAILED DESCRIPTION OF THE PREFERRED REGISTER LEVEL 
EMBODIMENT 

Changes to the second part of the arbitration process detailed above are implemented in 
the form of a masking element that is inserted in front of the grant round robin stages. In 
a thirty-two-port arbitration stage, there would need to be one thousand and twenty four 
masking elements. Figure 4 shows the Register Transfer Level (RTL) design of one 
individual masking element. Figure 5 shows the construction of one egress port from 
thirty- two masking elements. 

Masking Element Operation 
This sequence references Figure 4. 

1 . The weighting for the one request is loaded into the default weight register (I). 
First cycle: 

2. This default weight held by (1) is then transferred onto the working register (3), via 
the muliiplexor (2), on the clock PGT that signifies the start of the first cycle. 

3. The non-zero status of the weight in the working register (3) is determined by the OR 
gate (4). 

4. The status determined by (4) is then used to mask out the request if the status of the 
working register (3) is zero. This is performed by the AND gate (5). 

5. The value of the weight in working register (3) is passed to the Decrementer (8) 
where if a connection has been received due to a request made on the previous cycle 
the value of the weight is decremented by one and returned to working register (3), 

6. The non-zero status of the decremented weight is determined by the OR gate (6). 

7. If there is no request in this cycle and the weight value has been decremented to 
produce a non— zero result as determined by (6) then the reset condition for this 
request has been met. This product function is provided by (7). 



When this condition has been met for all requests to a single Egress port, the default 
weights for all requests to this Egress port are required to be reloaded into their 
relevant working registers (3). This is determined by (9), which has a similar 
connection to all other masking elements for this Egress port. 
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Figure 1: Legal/Illegal arbitration states 
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Figure 2: Weight Reset Conditions. 
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Figure 3: Single output port request masking process. 
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Figure 4: Single Masking Element 




Figure 5: Single Egress Port with Masking Elements 



